NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Implicit Values Embedded in How Humans and LLMs Complete Subjective Everyday Tasks

Arunasalam, Arjun; Pickering, Madison; Celik, Z Berkay; Ur, Blase (November 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing)

Large language models (LLMs) can underpin AI assistants that help users with everyday tasks, such as by making recommendations or performing basic computation. Despite AI assistants’ promise, little is known about the implicit values these assistants display while completing subjective everyday tasks. Humans may consider values like environmentalism, charity, and diversity. To what extent do LLMs exhibit these values in completing everyday tasks? How do they compare with humans? We answer these questions by auditing how six popular LLMs complete 30 everyday tasks, comparing LLMs to each other and to 100 human crowdworkers from the US. We find LLMs often do not align with humans, nor with other LLMs, in the implicit values exhibited.
more » « less
Full Text Available
What Does It Mean to Be Creepy? Responses to Visualizations of Personal Browsing Activity, Online Tracking, and Targeted Ads

https://doi.org/10.56553/popets-2024-0101

Reitinger, Nathan; Wen, Bruce; Mazurek, Michelle L; Ur, Blase (July 2024, Proceedings on Privacy Enhancing Technologies)

Internet companies routinely follow users around the web, building profiles for ad targeting based on inferred attributes. Prior work has shown that these practices, generally, are creepy—but what does that mean? To help answer this question, we substantially revised an open-source browser extension built to observe a user's browsing behavior and present them with a tracker's perspective of that behavior. Our updated extension models possible interest inferences far more accurately, integrates data scraped from the user's Google ad dashboard, and summarizes ads the user was shown. Most critically, it introduces ten novel visualizations that show implications of the collected data, both the mundane (e.g., total number of ads you've been served) and the provocative (e.g., your interest in reproductive health, a potentially sensitive topic). We use our extension as a design probe in a week-long field study with 200 participants. We find that users do perceive online tracking as creepy—but that the meaning of creepiness is far from universal. Participants felt differently about creepiness even when their data presented similar visualizations, and even when responding to the most potentially provocative visualizations—in no case did more than 66% of participants agree that any one visualization was creepy.
more » « less
Full Text Available
Data Subjects’ Reactions to Exercising Their Right of Access

Borem, Arthur; Pan, Elleen; Obielodan, Olufunmilola; Roubinowitz, Aurelie; Dovichi, Luca; Mazurek, Michelle L.; Ur, Blase (August 2024, Proceedings of the 33rd USENIX Security Symposium)

Recent privacy laws have strengthened data subjects’ right to access personal data collected by companies. Prior work has found that data exports companies provide consumers in response to Data Subject Access Requests (DSARs) can be overwhelming and hard to understand. To identify directions for improving the user experience of data exports, we conducted an online study in which 33 participants explored their own data from Amazon, Facebook, Google, Spotify, or Uber. Participants articulated questions they hoped to answer using the exports. They also annotated parts of the data they found confusing, creepy, interesting, or surprising. While participants hoped to learn either about their own usage of the platform or how the company collects and uses their personal data, these questions were often left unanswered. Participants’ annotations documented their excitement at finding data records that triggered nostalgia, but also shock about the privacy implications of other data they saw. Having examined their data, many participants hoped to request the company erase some, but not all, of the data. We discuss opportunities for future transparency-enhancing tools and enhanced laws.
more » « less
Full Text Available
Data Subjects' Reactions to Exercising Their Right of Access

Borem, Arthur; Pan, Elleen; Obielodan, Olufunmilola; Roubinowitz, Aurelie; Dovichi, Luca; Mazurek, Michelle L; Ur, Blase (August 2024, USENIX Security Symposium)

Recent privacy laws have strengthened data subjects' right to access personal data collected by companies. Prior work has found that data exports companies provide consumers in response to Data Subject Access Requests (DSARs) can be overwhelming and hard to understand. To identify directions for improving the user experience of data exports, we conducted an online study in which 33 participants explored their own data from Amazon, Facebook, Google, Spotify, or Uber. Participants articulated questions they hoped to answer using the exports. They also annotated parts of the export they found confusing, creepy, interesting, or surprising. While participants hoped to learn either about their own usage of the platform or how the company collects and uses their personal data, these questions were often left unanswered. Participants' annotations documented their excitement at finding data records that triggered nostalgia, but also shock and anger about the privacy implications of other data they saw. Having examining their data, many participants hoped to request the company erase some, but not all, of the data. We discuss opportunities for future transparency-enhancing tools and enhanced laws.
more » « less
Full Text Available
Can Allowlists Capture the Variability of Home IoT Device Network Behavior?

https://doi.org/10.1109/EuroSP60621.2024.00015

He, Weijia; Bryson, Kevin; Calderon, Ricardo; Prakash, Vijay; Feamster, Nick; Huang, Danny Yuxing; Ur, Blase (July 2024, IEEE)

Full Text Available
JupyterLab in Retrograde: Contextual Notifications That Highlight Fairness and Bias Issues for Data Scientists

Harrison, Galen; Bryson, Kevin; Bamba, Ahmad Emmanuel; Dovichi, Luca; Binion, Aleksander Herrmann; Borem, Arthur; Ur, Blase (May 2024, Proceedings of the CHI Conference on Human Factors in Computing Systems)

Current algorithmic fairness tools focus on auditing completed models, neglecting the potential downstream impacts of iterative decisions about cleaning data and training machine learning models. In response, we developed Retrograde, a JupyterLab environment extension for Python that generates real-time, contextual notifications for data scientists about decisions they are making regarding protected classes, proxy variables, missing data, and demographic differences in model performance. Our novel framework uses automated code analysis to trace data provenance in JupyterLab, enabling these notifications. In a between-subjects online experiment, 51 data scientists constructed loan-decision models with Retrograde providing notifications continuously throughout the process, only at the end, or never. Retrograde’s notifications successfully nudged participants to account for missing data, avoid using protected classes as predictors, minimize demographic differences in model performance, and exhibit healthy skepticism about their models.
more » « less
Full Text Available
Analysis of Google Ads Settings Over Time: Updated, Individualized, Accurate, and Filtered

https://doi.org/10.1145/3603216.3624968

Reitinger, Nathan; Wen, Bruce; Mazurek, Michelle L.; Ur, Blase (November 2023, Proceedings of the 21st Workshop on Privacy in the Electronic Society)

Advertising companies and data brokers often provide consumers access to a dashboard summarizing attributes they have collected or inferred about that user. These attributes can be used for targeted advertising. Several studies have examined the accuracy of these collected attributes or users’ reactions to them. However, little is known about how these dashboards, and the associated attributes, change over time. Here, we report data from a week-long, longitudinal study (𝑛=158) in which participants used a browser extension automatically capturing data from one dashboard, Google Ads Settings, after every fifth website the participant visited. The results show that Ads Settings is frequently updated, includes many attributes unique to only a single participant in our sample, and is approximately 90% accurate when assigning age and gender. We also find evidence that Ads Settings attributes may dynamically impact browsing behavior and may be filtered to remove sensitive interests.
more » « less
Full Text Available
Evaluation of Ad Transparency Systems

Bryson, Kevin; Borem, Arthur; Moh, Phoebe; Akgul, Omer; Edelson, Laura; Geeng, Chris; Lauinger, Tobias; Michelle L. Mazurek; McCoy, Damon; Ur, Blase (May 2024, ConPro 2024: IEEE SPW Workshop on Technology and Consumer Protection)

In this research proposal, we outline our plans to examine the characteristics and affordances of ad transparency systems provided by 22 online platforms. We outline a user study designed to evaluate the usability of eight of these systems by studying the actions and behaviors each system enables, as well as users' understanding of these transparency systems.
more » « less
Full Text Available
Summarizing Sets of Related ML-Driven Recommendations for Improving File Management in Cloud Storage

https://doi.org/10.1145/3526113.3545704

Brackenbury, Will; Chard, Kyle; Elmore, Aaron; Ur, Blase (October 2022, Proceedings of the 35th ACM Symposium on User Interface Software and Technology (UIST))

Personal cloud storage systems increasingly offer recommendations to help users retrieve or manage files of interest. For example, Google Drive's Quick Access predicts and surfaces files likely to be accessed. However, when multiple, related recommendations are made, interfaces typically present recommended files and any accompanying explanations individually, burdening users. To improve the usability of ML-driven personal information management systems, we propose a new method for summarizing related file-management recommendations. We generate succinct summaries of groups of related files being recommended. Summaries reference the files' shared characteristics. Through a within-subjects online study in which participants received recommendations for groups of files in their own Google Drive, we compare our summaries to baselines like visualizing a decision tree model or simply listing the files in a group. Compared to the baselines, participants expressed greater understanding and confidence in accepting recommendations when shown our novel recommendation summaries.
more » « less
Full Text Available
Defining "Broken": User Experiences and Remediation Tactics When Ad-Blocking or Tracking-Protection Tools Break a Website’s User Experience

Nisenoff, Alexandra; Borem, Arthur; Pickering, Madison; Nakanishi, Grant; Thumpasery, Maya; Ur, Blase (January 2023, Proceedings of the 32nd USENIX Security Symposium)

To counteract the ads and third-party tracking ubiquitous on the web, users turn to blocking tools---ad-blocking and tracking-protection browser extensions and built-in features. Unfortunately, blocking tools can cause non-ad, non-tracking elements of a website to degrade or fail, a phenomenon termed breakage. Examples include missing images, non-functional buttons, and pages failing to load. While the literature frequently discusses breakage, prior work has not systematically mapped and disambiguated the spectrum of user experiences subsumed under "breakage," nor sought to understand how users experience, prioritize, and attempt to fix breakage. We fill these gaps. First, through qualitative analysis of 18,932 extension-store reviews and GitHub issue reports for ten popular blocking tools, we developed novel taxonomies of 38 specific types of breakage and 15 associated mitigation strategies. To understand subjective experiences of breakage, we then conducted a 95-participant survey. Nearly all participants had experienced various types of breakage, and they employed an array of strategies of variable effectiveness in response to specific types of breakage in specific contexts. Unfortunately, participants rarely notified anyone who could fix the root causes. We discuss how our taxonomies and results can improve the comprehensiveness and prioritization of ongoing attempts to automatically detect and fix breakage.
more » « less
Full Text Available

« Prev Next »

Search for: All records